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Abstract 

This paper makes three contributions to cyber-security research. First, we define a model for 
cyber-security systems and the concept of a cyber-security attack within the model’s framework. 

The model highlights the importance of game-over components —critical system components 
which if acquired will give an adversary the ability to defeat a system completely. The model 
is based on systems that use defense-in-depth/layered-security approaches, as many systems 
do. In the model we define the concept of penetration cost, which is the cost that must be 
paid in order to break into the next layer of security. Second, we define natural decision and 
optimization problems based on cyber-security attacks in terms of doubly weighted trees, and 
analyze their complexity. More precisely, given a tree T rooted at a vertex r, a penetrating 
cost edge function c on T, a target-acquisition vertex function p on T, the attacker’s budget 
and the game-over threshold B,G G respectively, we consider the problem of determining 
the existence of a rooted subtree T' of T within the attacker’s budget (that is, the sum of the 
costs of the edges in T' is less than or equal to B) with total acquisition value more than the 
game-over threshold (that is, the sum of the target values of the nodes in T' is greater than or 
equal to G). We prove that the general version of this problem is intractable, but does admit 
a polynomial time approximation scheme. We also analyze the complexity of three restricted 
versions of the problems, where the penetration cost is the constant function, integer-valued, and 
rational-valued among a given fixed number of distinct values. Using recursion and dynamic¬ 
programming techniques, we show that for constant penetration costs an optimal cyber-attack 
strategy can be found in polynomial time, and for integer-valued and rational-valued penetration 
costs optimal cyber-attack strategies can be found in pseudo-polynomial time. Third, we provide 
a list of open problems relating to the architectural design of cyber-security systems and to the 
model. 

Keywords: cyber security, defense-in-depth, game over, information security, layered security, 
weighted rooted trees, complexity, polynomial time, pseudo-polynomial time. 
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1 Introduction 


Our daily life, economic vitality, and a nation’s security depend on a stable, safe, and secure 
cyberspace. Cyber security is so important that the United States (US) Department of Defense es¬ 
tablished the US Cyber Command to take charge of pulling together existing cyberspace resources, 
creating synergy, and synchronizing war-fighting efforts to defend the information-security environ¬ 
ment of the US [23]. Other countries also have seen the importance of cyber security. To name just 
a few in what follows, in response to North Korea’s creation of a cyber-warfare unit, South Korea 
created a cyber-warfare command in December 2009 [23]. During 2010, China introduced its first 
department dedicated to defensive cyber war and information security in response to the creation 
of the US Cyber Command [3j. The United Kingdom has also stood up a cyber force [5|. Other 
countries are quickly following suit. 

Cyberspace has become a new frontier that comes with new opportunities, as well as new risks. 
According to a 2012 study of US companies, the occurrence of cyber attacks has more than doubled 
over a 3-year period while the adverse financial impact has increased by nearly 40 percent [8|. More 
specifically, US organizations experienced an average of 50, 72, and 102 successful attacks against 
them per week in 2010, 2011, and 2012, respectively. In [21] a wide range of cyber-crime statistics 
are reported, including locations of attacks, motivation behind attacks, and types of attacks. The 
number of cyber attacks is increasing rapidly, and for the month of June 2013, 4% of attacks were 
classified as cyber warfare, 8% as cyber espionage, 26% as hacktivism, and 62% as cyber crime 
(see |21]1. Over the past couple of years these percentages have varied significantly from month-to- 
month. In order to respond to cyber attacks, organizations have spent increasing amounts of time, 
money, and energy at levels that are now becoming unsustainable. Despite the amounts of time, 
money, and energy pouring into cyber security, the field is still emerging and widely applicable 
solutions to the problems in the field have not yet been developed. 

A secure system must defend against all possible cyber attacks, including zero-day attacks that 
have never been known to the defenders. But, due to limited resources, defenders generally develop 
defense systems for the attacks that they do know about. Their systems are secure to known attacks, 
but then become insecure as new kinds of attacks emerge, as they do frequently. To build a secure 
system, therefore, requires first principles of security. “In other words, we need a science of cyber 
security that puts the construction of secure systems onto a firm foundation by giving developers 
a body of laws for predicting the consequences of design and implementation choices” |19] . To this 
end Schneider called for more models and abstractions to study cyber security m- In his article 
Schneider suggested building a science of cyber security from existing areas of computer science. In 
particular, he mentioned formal methods, fault-tolerance, cryptography, information theory, game 
theory, and experimental computer science. All of these subfields of computer science are likely to 
be valuable sources of abstractions and laws. 

Cyber security presents many new challenges. Dunlavy et al. discussed what they saw as 
some of the major mathematical problems in cyber security |9|. One of the main challenges is 
modeling large-scale networks using explanatory and predictive models. Naturally, graph models 
were proposed. Some common measures of a graph that such a model would seek to emulate are 
distribution over the entire graph of vertex in-degrees and out-degrees, graph diameter, community 
structure, and evolution of any of the mentioned measures over time |6]. Pfleeger discussed a number 
of useful cyber-security metrics HZ]. She introduced an approach to cyber-security measurement 
that uses a multiple-metrics graph as an organizing structure by depicting the attributes that 
contribute to overall security, and uses a process query system to test hypotheses about each of 
the goals based on metrics and underlying models. Rue, Pfleeger, and Ortiz developed a model- 
evaluation framework that involves making explicit each model’s assumptions, required inputs, and 
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applicability conditions [T8] . 

Complexity science, which draws on biological and other natural analogues, seems under utilized, 
but perhaps is one of the more-promising approaches to understanding problems in the cyber¬ 
security domain [3]. Armstrong, Mayo, and Siebenlist suggested that models of complex cyber 
systems and their emergent behavior are needed to solve the problems arising in cyber security [3]. 
Additionally, theories and algorithms that use complexity analysis to reduce an attacker’s likelihood 
of success are also needed. Existing work in the fields of fault tolerance and high-reliability systems 
are applicable too. Shiva, Roy, and Dasgupta proposed a cyber-security model based on game 
theory [20) . They discovered that their model works well for a dynamically-changing scenario, 
which often occurs in cyber systems. Those authors considered the interaction between the attacks 
and the defense mechanisms as a game played between the attacker and the defender. 

This paper is our response to the call for more cyber-security models in |19] . This work also 
draws attention to the importance of designing systems that do not have game-over components — 
components that are so important that once an adversary has taken them over, one’s system is 
doomed. Since, as we will see, such systems can be theoretically hacked fairly efficiently. We model 
(many known) security systems mathematically and then discuss their vulnerabilities. Our model’s 
focus is on systems having layered security; each security layer possesses valuable assets that are 
kept in containers at different levels. An attacker attempts to break into these layers to obtain 
assets, paying penetration costs along the way in order to break in, and wins if a given game-over 
threshold is surpassed before the attacker’s budget runs out. A given layer of security might be, for 
example, a firewall or encryption. The associated cost of by-passing the firewall or encryption is 
the penetration cost that is used in the model. We formalize the notion of a cyber attack within the 
framework of the model. For a number of interesting cases we analyze the complexity of developing 
cyber-attack strategies. 

The outline of this article is as follows. In Section [2] we define the model for cyber-security 
systems, present an equivalent weighted-tree view of the model, and define natural problems re¬ 
lated to the model. A general decision problem (Game-Over Attack Strategy, Decision Problem 
GOAS-DP) based on the model is proved NP-complete in Section[3l its corresponding optimization 
problem (GOAS-OP) is NP-hard. In sections 01 El and [6] we provide a polynomial-time algorithm 
for solving GOAS-OP when penetration costs are constant, a pseudo-polynomial-time algorithm 
for solving GOAS-OP when penetration costs are integers, a polynomial-time approximation algo¬ 
rithm for solving GOAS-OP in general, and a polynomial-time algorithm for solving GOAS-OP 
when penetration costs are rational numbers from a prescribed finite collection of possible ratio¬ 
nal costs, respectively. As an easy corollary, we obtain a pseudo-polynomial-time algorithm for 
solving an optimization problem on general weighted non-rooted trees. Table [T] summarizes the 
computational results of the paper. Conclusions and open problems are discussed in Section [71 

2 Model for Cyber-Security Systems 

2.1 Basic Setup 

When defining our cyber-security game-over model, we need to strike a balance between simplicity 
and utility. If the model is too simple, it will not be useful to provide insight into real situations; 
if the model is too complex, it will be cumbersome to apply, and we may get bogged down in too 
many details to see the forest from the trees. In consultation with numerous cyber-security experts, 
computer scientists, and others, we have come up with a good compromise for our model between 
ease-of-use and the capability of providing useful insights. 

Many systems contain layered security or what is commonly referred to as defens e-in-depth, 
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Problem Name 

Time 

Class 

GOAS-DP 

- 

NP-complete 

GOAS-OP 

- 

NP-hard 

GOAS-DP constant pc 

O(m^n) 

P 

GOAS-OP constant pc 

0{m?n) 

P 

GOAS-DP integer pc 

0{B^n) 

pseudo-pt 

GOAS-OP integer pc 

0{B^n) 

pseudo-pt 

GOAS-OP approx. 

0((l/e)2n3) 

P 

GOAS-DP rational pc 


P 

GOAS-OP rational pc 


P 


Table 1: Summary of results about the cyber-security model contained in the paper. Note that in 
the table “pc” stands for “penetration cost,” and “pseudo-pt” stands for pseudo-polynomial time. 
The values of m, n, B, and d are as given in the respective theorems. 


where valuable assets are hidden behind many different layers or secured in numerous ways. For 
example, a host-based defense might layer security by using tools such as signature-based vendor 
anti-virus software, host-based systems security, host-based intrusion-prevention systems, host- 
based firewalls, encryption, and restriction policies, whereas a network-based defense might provide 
defense-in-depth by using items such as web proxies, intrusion-prevention systems, firewalls, router- 
access control lists, encryption, and filters [13]. To break into such a system and steal a valuable 
asset requires several levels of security to be penetrated. Our model focuses on this layered aspect 
of security and is intended to capture the notion that there is a cost associated with penetrating 
each additional level of a system and that attackers have finite resources to utilize in a cyber attack. 
We also build the concept of critical game-over components. 

2.2 Definition of the Cyber-Secnrity Game-Over Model 

Let N = {1, 2,3,...}, Q be the rational numbers, and Q"'' be the positive rational numbers. With 
the intuition provided in the previous section in mind, we now present the formal definition of the 
model. 

Definition 2.1. A cyber-security game-over model M is a six-tuple (T, C, T>, C, B, G), where 

1. The set T = {ti,t 2 , ■ ■ ■, tfc} is a eollection o/targets, where fe G N. The value k is the number 
of targets. Corresponding to eaeh target ti, for 1 <i <k, is an associated target acquisition 
value v{ti), where v{ti) G Q. We also refer to the target acquisition value as the acquisition 
value for short, or as the reward or prize. 

2. The set C = {ci, C 2 ,..., q} is a eollection o/containers, where I G N. The value I is the 
number of containers. Corresponding to each container Ci, for 1 < i < I, is an associated 
penetration cost p(cj), where p{ci) G Q. 

3. The set V = {Ci, C 2 ,. ■ ■, Ci} is the set 0 /container nestings. The tuple Ci, for 1 < i < I, is 
called the penetration list for container Ci and is a list in left-to-right order of containers that 
must be penetrated before Ci can be penetrated. If a container Ci has an empty penetration list, 
and its cost p{ci) has been paid, we say that the container has been penetrated. If a container 
Ci has a non-empty penetration list and each container in its list has been penetrated in left- 
to-right order, and its cost p{ci) has been paid, we say that the container has been penetrated. 
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The number of items in the tuple Ci is referred to as the depth of penetration required for Cj. 
If container Cj appears in cfs tuple Ci, we say that container Ci is dependent on container Cj. 
If there are no two containers Ci and Cj such that container Ci is dependent on container Cj 
and container Cj is dependent on container Ci, then we say the model is well-formed. 

4 . The set C = {li,l 2 , ■ ■ ■, Ik} is a list of container names. These containers specify the level-1 
locations of the targets. For 1 <i <k if target ti has level-1 location li, this means that there 
is no other container c such that container c is dependent on container li and container c 
contains target ti. Target ti is said to be located at level-1 in container h. The target ti is also 
said to he located in container li or any container on which container li is dependent. When 
a target’s level -1 container has been penetrated, we say that the target has been acquired. 

5. The value B ^ Q is the attacker’s budget. The value represents the amount of resources that 
an attacker can spend on a cyber attack. 

6 . The value G G Q is the game-over threshold signifying when critical components have been 
acquired. 

The focus of this paper is on cyber-security game-over models that are well-formed, which are 
motivated by real-world scenarios. In the next section we introduce a graph-theoretic version of 
the model using weighted trees. 

Remarks: (i) In part 3 of the definition we refer to the cost of a container Ci being paid. By 
this we simply mean that p{ci) has been deducted from the remaining budget, B', and we require 
that B' —p(ci) > 0. (ii) In part 4 of the dehnition we maintain a general notion of containment for 
targets by specifying the inner-most container in which a target is located. Although containers 
can have partial overlap, we require that the inner-most container be unique. In the next dehnition 
we formalize the notion of a cyber-security attack strategy. 

Definition 2.2. A cyber-security attack strategy in a cyber-security game-over model M is a list 
of containers ci,C 2 ,... ,Cr from M. The cost of an attack strategy is ^ valid attack 

strategy is one in which the penetration order is not violated. A game-over attack strategy in a 
cyber-security game-over model M is a valid attack strategy ci,C 2 , ■. ■ ,Cr whose cost is less than 
or equal to B and whose total target acquisition value such a game-over 

attack strategy in a cyber-security game-over model a (successful) cyber-security attack or cyber 
attack for short. 

Note that this notion of a cyber attack is more general than some, and, for example, espionage 
would qualify as a cyber attack under this dehnition. The dehnition does not require that a service 
or network be destroyed or disrupted. Since many researchers will think of Dehnition 12.11 from a 
graph-theory point of view, in the next section we offer that perspective. As we will soon see, the 
graph-theoretic perspective allows us to work more easily with the model mathematically and to 
relate to other known results. 

2.3 Game-Over Model in Terms of Weighted Trees 

In this section we describe the (well-formed) game-over model in terms of weighted trees. The set 
T> of nested containers in Dehnition 12.11 has a natural rooted-tree structure, where each container 
corresponds to a vertex that is not the root, and we have an edge from a parent u down to a child 
V if and only if the corresponding container c{u) includes the container c(u) in it. The weight of an 
edge from a parent to a child represents the cost of penetrating the corresponding container. The 
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weight of a vertex represents the acquisition value/prize/reward obtained by penetrating/breaking 
into that container. 

Sometimes we do not distinguish a target from its acquisition value/prize/reward nor a container 
from its penetration cost. We can assume that the number of containers and targets is the same. 
Since if we have a container housing another container (and nothing else), we can just look at this 
“double” container as a single container of penetration cost equal to the sum of the two nested 
ones. Also, if a container contains many prizes, we can just lump them all into a single prize, which 
is the sum of them all. The following is a graph-theoretic version of Dehnition 12.11 

Definition 2 . 3 . A cyber-security (game-over) model (CSM) M is given by an ordered five tuple 
M = {T,c,p, B,G), where T is a tree rooted at r having n E N non-root vertices, c : E{T) —>■ Q 
is a penetration-cost weight function, p : V{T) Q is the target-acquisition-value weight function, 
and B,G ^ are the attacker’s budget and the game-over threshold value, respectively. 

Remarks: (i) Note that V{T) = {r,ui,... ,Un}, where r is the designated root that indicates 
the start of an attack, (ii) In most situations we have the weights c and p being non-negative 
rational numbers, and p{r) = 0. 

Recall that in a rooted tree T each non-root vertex u E V{T) has exactly one parent. We let 
e{u) E E{T) denote the unique edge connecting u to its parent. For the root r, we let e(r) be the 
empty set and c(e(r)) be 0. For a tree T with u E V{T), we let T{u) denote the (largest) subtree 
of T rooted at u. It is easy to see the correspondence between Definitions 12.11 and 12.31 Analogously 
to Definition Ea we next define a cyber-security attack strategy in the weighted-tree model. 

Definition 2 . 4 . A cyber-security attack strategy (CSAS) in a CSM M = {T,c,p, B,G) is given 
by a subtree T' of T that contains the root r ofT. 

• We define the cost of a CSAS T' to be c{T') = Y1u&v{t>) 

• We define a valid CSAS (VCSAS) to be a CSAS T' with c{T') < B. 

• We define the prize of a CSAS T' to be p{T') = YlueV{T')Pi'^)- 

A game-over attack strategy (GOAS) in a CSM M = {T,c,p, B,G) is a VCSAS T' withp(T') > G. 
IFe sometimes refer to such a GOAS simply as a cyber-security attack or cyber attack for short. 

Note that in Definition 12.41 we use c (resp. p) to denote the total cost (respectively, total prize) 
of a cyber-security attack strategy. We also use c (resp. p) as the penetration-cost weight function 
(respectively, target-acquisition-value weight function). The overloading of this notation should not 
cause any confusion. Throughout the remainder of the paper, we will use Dehnitions 12.31 and 12.41 

2.4 Cyber-Attack Problems in the Game-Over Model 

We now state some natural questions based on the CSM. 

Problem 2 . 5 . Given: A cyber-security model M = {T,c,p, B,G). 

• Game-Over Attack Strategy, Decision Problem (GOAS-DP): 

Is there a game-over attack strategy in M? 

• Game-Over Attack Strategy, Optimization Problem (GOAS-OP): 

What is the maximum prize of a valid game-over attack strategy in M? 

Needless to say, some special cases are also of interest, in particular, in Problems 12.51 when c 
is (i) a constant rational function, (ii) an integer-valued function, or (iii) takes only finitely many 
given rational values. We explore the general GOAS and these other questions in the following 
sections. 
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2.5 Some Limitations of the Model 


Our model is a theoretical model. It is designed to give us a deeper understanding of cyber attacks 
and cyber-attack strategies. Of course, a real adversary is not in possession of complete knowledge 
about a system and its penetration costs. Nevertheless, it is interesting to suppose that an adversary 
is in possession of all of this information, and then to see what an adversary is capable of achieving 
under these circumstances. Certainly an adversary with less information could do no better than 
our fully informed adversary. 

We are considering systems as they are. That is, we are given some system, targets, and 
penetration costs. If the system is a real system, we are not concerned about how to improve the 
security of that system per se. We assume that the system is already in a hardened state. We then 
examine how difficult it would be to attack such a system. We do not examine the question of 
implementations of a system. Our model can be used on any existing system. Some real systems 
will have more than one possible path to attack a target. And, in the future it may be worth 
generalizing the model to structures other than trees. The first step is to look at trees and derive 
some insight from these cases. 

We have purposely chosen a target acquisition function which is simple. That is, we merely 
add together the total costs of the targets acquired. Studying this simple acquisition function is 
the first step. It may be interesting to study more-complex acquisition functions in the future. For 
example, one can imagine two targets that in and of themselves are of no real value, but when the 
information contained in the two are combined they are of great value. In some cases our additive 
function can capture this type of target depending on the structure of the model. 

We describe the notion of a game-over component. In the model this concept is an abstract 
one. A set of components whose total value exceeds a given threshold comprise a “game-over 
component.” A game-over component is not necessarily a single target although one can think of 
a high-cost target, which is included as a target in a set of targets that push us over the game-over 
threshold, as being the game-over component. 

For easy reference, the following table contains our most common abbreviations, their spelled 
out meaning, and where they are defined. 


CSM 

cyber-security (game-over) model 

Def.O 

CSAS 

cyber-security attack strategy 

Def.El 

VCSAS 

valid cyber-security attack strategy 

Def.El 

GOAS 

game-over attack strategy 

Def.El 

GOAS-DP 

game-over attack strategy, decision problem 

Def.123] 

GOAS-OP 

game-over attack strategy, optimization problem 

Def.ES] 


Table 2: Abbreviations we use throughout the paper, all defined in this section. 


3 Complexity of Cyber-Attack Problems 

In this section we show that the general game-over attack strategy problems are intractable, that 
is, highly unlikely to be amenable to polynomial-time solutions. Consider a cyber-security attack 
model M, where T is a star centered at r having n leaves ui,... ,Un- Since each cyber-security 
attack T' of M can be presented as a collection E' C E{T) of edges of T, and hence also as a 
collection of vertices V' C V{T) by T' = T[{r} U V], and vice versa, each collection of vertices 
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V' (T) can be presented asV' = V{T') for some cyber-secnrity attack T' of M, and the GOAS- 
DP is exactly the decision problem of the 0/1-Knapsack Problem [TO], and the GOAS-OP is 
the optimization problem of the Knapsack Problem. Note that the 0/1-Knapsack Problem 
is usually stated using natural numbers as weights, but clearly the case for weights consisting of 
rational numbers is no easier to solve yet still in NP. So, we have the following observation. 

Observation 3.1. The GOAS-DP is NP-complete; the GOAS-OP is an NP-hard optimization 
problem. 

Remark: Observation ED answers an open qnestion in the last section of m. where it is 
asked whether or not the LST-Tree Problem can be solved in polynomial time (we presume) for 
general edge lengths. Observation 13.11 is similar to [71 Theorem 2], where also a star is considered 
to show that their SubtreeE is as hard as Knapsack. 

Notice that the NP-completeness of GOAS-DP is a donble-edge sword. It suggests that even 
an attacker who has detailed knowledge of the defenses of a cyber-security system would find the 
problem of allocating his (attack) resources difficult. On the other hand, the NP-completeness also 
makes it difficult for the defender to assess the security of his system. However, we will see in 
Section [5l that if we allow a slight proportional increase of the attacker’s budget B to an amount 
of (1 -|- e)B for an e > 0, then GOAS-OP admits a polynomial time approximation scheme, so it 
can be solved in time polynomial in n and 1/e. 

Sections mu and El consider the complexity of cyber-security attacks where c is a constant¬ 
valued cost function, an integer-valued cost function, and a rational-valued cost function of finitely 
many possible values, respectively. In Section [5l as mentioned, we also obtain an approximation 
algorithm for solving GOAS-OP, and a solution on general weighted non-rooted trees. In all cases 
we are able to give reasonably efficient algorithms for solving GOAS-OP. 

4 Cyber Attacks with Constant Penetration Costs 

In this section we show that if all penetration costs have the same value then the Game-Over 
Attack Strategy Problems can be solved efficiently in polynomial time. Consider a GSM M, 
where c is a constant function taking a constant rational value c(e) = c for each e € E(T). That 
is, all penetration costs are a fixed-rational value. This variant is the first interesting case of the 
GOAS-DP and GOAS-OP, as there are related problems and solutions in the literature. One of 
the first papers on maximum-weight subtrees of a given tree with a specific root is [I], where it is 
shown that the rooted subtree problem, that is, to find a maximum-weight subtree with a specific 
root from a given set of subtrees, is in polynomial time if, and only if, the subtree packing problem, 
that is, to find maximum-weight packing of vertex-disjoint subtrees from a given set of subtrees 
(where the value of each subtree can depend on the root), is in polynomial time. In more-recent 
papers the weight-constrained maximum-density subtree problem (WMSP) is considered: given a 
tree T having n vertices, and two functions I, w : E[T) —)■ Q representing the “length” and “weight” 
of the edges, respectively, determine the subtree T' of T such that / YleeE{T') is 

a maximum, subject to Yle&E{T') 'w(e} having a given upper bound. In [13] an 0(u)inax^)-time 
algorithm is given to solve the related, and more restricted, weight-constrained maximum-density 
path problem (WMPP), as well as an algorithm to solve the WMSP. In [15] an 

0(n[/^)-time algorithm is given for the WMSP, where U is the maximum total length of the subtree, 
and in [22] an 0(nt/lgn)-time algorithm for the WMSP is given, which is an improvement in the 
case when U = D(lgn). The WMSP has a wide range of practical applications. In particular, the 
related WMPP has applications in computational biology m, and the related weight-constrained 


least-density path problem (WLPP) also has applications in computational biology, as well as in 
computer, traffic, and logistic network designs m- 

The WMSP is similar to our problem, and some of the same approaches used in m, IS], 
and [22] can be applied in our case, namely the techniques of recursion and dynamic programming. 
There are not existing results that apply directly to our problems. Note that there is a subtle 
difference between our GOAS-OP and the WMSP, as a maximum-weight subtree (that is, with 
the prize p{T') a maximum) might have low density and vice versa; a subtree of high density might 
be “small” with low total weight (that is, prize). 

In [7] a problem on trees related to the Traveling Salesman Problem with profits is studied, which 
is similar to what we do. Both here and in [7] the most general form of the problems considered, 
in our case GOAS-DP in Observation 13.II and in their case (as mentioned above) SubtreeE in [71 
Theorem 2], are observed to be as hard as Knapsack and hence NP-complete. Also, the results 
of fixed costs, in our case Theorem and in their case [71 Theorem 3], the problems are shown 
to be solvable in 0(n) time, given certain conditions. Theorem 14.21 however, provides a precise 
accounting for the time complexity and for certain values of m, defined there, our algorithm would 
be faster than that given in [7[. Their work is not in the context of cyber-security, and does not 
handle cases as general as this work. 

For a CSM M, where c is a constant function, we first note that T' is a VCSAS if and only 
if m = |£'(T')| < \B/c\. Hence, in this case the GOAS-OP reduces to finding a CSAS T' with 
at most m edges having p{T') at a maximum. Note that if m > n, then the GOAS-OP is trivial 
since T' = T is the optimal subtree. Hence, we will assume the budget B is such that m < n. 

In what follows, we will describe our dynamic programming setup to solve GOAS-OP in this 
case. The core of the idea is simple: we construct a 2 x 2 matrix for each vertex u in the tree T 
that stores the maximum prize of a subtree rooted at u on at most k edges and that contains only 
the rightmost d{u) — i + 1 branches from u, for each A: G {1,..., m} and z G {1,..., d{u)}. 

More specifically, we proceed as follows. We may assume that our rooted tree T has its vertices 
ordered from left-to-right in some arbitrary but fixed order, that is, T is a planted plane tree. Since 
T has n > 1 non-root vertices and n-\-l vertices total, we know by a classic counting exercise [2] that 
the number of planted plane trees on n-|-l vertices is given by the Catalan numbers Cn by obtaining 
a dehning recursion for Cn by decomposing each planted plane tree into two rooted subtrees. Using 
this decomposition, we introduce some notation. For a subtree r of T rooted at u G U(T) denote 
by t{v) the largest subtree of r that is rooted at a vertex u (if u G T[U(r)]). Denote by the 
leftmost child of n in r (if it exists). Let m = t{uii) denote the subtree of r generated by ui, that 
is, the largest subtree of T rooted at u^. Finally, let t" = r — V{t() = T[U(r) \ V{t^)] denote the 
subtree of r generated by the vertices not in ti. In this way we obtain a decomposition/partition 
of the planted plane tree r into two vertex-disjoint subtrees and t" whose roots are connected 
by a single edge e{ui). In particular, for each vertex u G V{T), we have a partition of T{u) into 
T{u)i = T{u() and T{u)", which we will denote by T"{u) (that is T{u)'' = T"{u)). Note that 
if n is a leaf, then T{u) = T"{u) = {u} and U£ = T{u£) = 0. Also, if u has exactly one child, 
which therefore is its leftmost child Ui, then T{u) is the two-path between u and its only child U£, 
T"{u) = {rt}, and T{u£) = {u£]. Assuming the degree of u is d{u), we can recursively dehne the 
trees T^(u),... by 


T\u) = T{u), 
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For each vertex u € V{T), we create a d{u) x (m + 1) rational matrix as follows: 



■ M^iu) 

Ml{u) ■ 


M(m) = 

M^{u) 

ufiu) ■ 





•• M^^\u) _ 


where M^{u) is the maximum prize of a subtree of T^{u) rooted at u with at most k edges for 
each i € {1, • • • ,d{u)} and k G {0,1,... ,m}. In particular, Mq{u) = p{u) for each vertex u and 
i G {1,..., d{u)}. For each leaf u of T, and each i and k, we set M'^{u) = p{u), and for each internal 
vertex u we have a recursion given in the following way: for a vertex u and an arbitrary subtree r 
rooted at u, we let Mk{u]T) be the maximum prize of a subtree of r rooted at u having k edges 
or 0 if vertex u does not exist. If a maximum-prize subtree of r with k edges does not contain 
the edge from u to its leftmost child ug, then Mk{u]T) = Mk{u;T"). Otherwise, such a maximum 
subtree contains i — 1 edges from ti and k — i edges from t". The following lemma is easy to show. 

Lemma 4.1. The arbitrary subtree r rooted at u is a maximum-prize subtree with at most k edges 
that contains the leftmost child U£ of u if and only if the included subtree of ti is a maximum-prize 
subtree with at most i — 1 edges rooted at ui and the included subtree of r” is a maximum-prize 
subtree with at most k — i edges rooted at u for some i G {!,... ,A:}. 

By Lemma l4.ll we therefore have the following recursion: 

Mk{u\T) = max ( Mfc(u;r"), max [Mi_i{uf,Ti) + Mk_i{u]T''))] . (1) 

y l<i<fc J 

Since now Ml{u) = Mk{u]T^{u)) for each i and k, we see that we can compute each Ml{u) from 
the smaller M’s as given in ([T|) using 0(A;)-arithmetic operations. Because k G {0,1,... ,m}, this 
fact means in 0(m)-arithmetic operations. Since we assume each arithmetic operation takes one 
step, we have that each M^{u) can be computed in 0(m)-time given the required inputs. Therefore, 
M(ti) can be computed in d{u)m-0{m) = d(n)0(m^)-time. Performing these calculations for each 
of the n vertices of our given tree T, we obtain by the Handshaking Lemma a total time of 

t{n) = ^ d{u)0{mf) = 0 {mf) ^ d{u) = 0 {m?)2{n — 1) = 0 {mfn). 

u£V(T) u£V{T) 

We finally compute a maximum prize VCSAS T' in M by p{T') = M^{r) for the root r of T. We 
conclude by the following theorem. 

Theorem 4.2. If M = (T,c,p, B,G) is a CSM, where T has n vertices, c is a constant function, 
and m = [H/cJ then the GOAS-OP can be solved in 0{m?n)-time. 

Remarks: (i) Note that Theorem 14.21 is similar to [71 Theorem 3]. (ii) Also note that the 
overhead constant is “small”: for each vertex u, each k, and each i by © each of Ml{u) = 
Mi:(u;T^{u)) uses exactly 2k arithmetic operations, namely k additions and k comparisons. Hence, 
the exact number of arithmetic operations can, by the Handshaking Lemma, be given by 

m m 

N{n,m)= ^ ^^d{u){2k) = ^ d{u) 2k = 2\E{T)\m‘^ = 2{n — l)m^. 

ueV{T) k=0 ueV{T) k=0 

We obtain an overhead constant of two. Since we assumed the budget given is such that m < n, 
we see that the GOAS-OP can be solved in 0{n^) time. 

Corollary 4.3. The GOAS-DP when restricted to constant-valued penetration costs can be solved 
in O(n^) time and is in P. 
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5 Cyber Attacks with Integer Penetration Costs and an Approx¬ 
imation Scheme 

In this section we show that if all penetration costs are non-negative integers then the Game-Over 
Attack Strategy Problems can be solved in pseudo-polynomial time. We will then use that 
to obtain a polynomial time approximation algorithm. 

5.1 Integer valued cost 

Consider now a CSM M = (T, c,p, B, G), where c is a non-negative integer-valued function, that is, 
c(e) € {0,1, 2,...} for each e G E{T). Note that we can contract T by each edge e with c(e) = 0, 
thereby obtaining a tree for our CSM M, where c : E[T) —>■ N takes only positive-integer values. We 
derive a polynomial-time algorithm in terms of n and B to solve the GOAS-OP. We can assume 
B is an integer here as well since otherwise we could just replace B with [B\. To produce our new 
algorithm we will tweak the argument given in Section [J] for the case when the cost function c is a 
constant. 

Using the same decomposition of a subtree r of T into U£ and t" for our dynamic programming 
scheme, for each vertex u we will assign, as before, a d{u) x (i? -|- 1) integer matrix as follows: 



r Kiu) 

Nl{u) ■ 

•• N^u) 1 

N(n) = 

Niiu) 

Nf{u) • 

•• Nl{u) 


1 


-1 

cq 


where N^iu) is the maximum prize of a subtree of T^{u) rooted at u of total cost at most k for each 
z G {1,..., d{u)} and A; G {0,... , B}. As before, we have Nq{u) = p{u) for each vertex u. Similarly 
to Lemma liTl we obtain the following. 

Lemma 5.1. The arbitrary subtree r rooted at u is a maximum-prize subtree of total cost at most 
k that contains the leftmost child ui of u if and only if the included subtree of ti is a maximum- 
prize subtree of total cost at most i — c(e(u£)) rooted at U£ and the included subtree of r" is a 
maximum-prize subtree of total cost k — i rooted at u, for some i G {c(e(u£)),..., k}. 

Using similar notation and definitions as in Section 01 by Lemma 15.11 we get the following 
recursion: 

iVfe(u; r) = max ( iV^(n; r"), ^ max {Ni_c{e(ut)){uT n) + Efk-i{u] r")) I , (2) 

\ c{e{ui))<i<k J 

and we obtain similarly the following. 

Theorem 5.2. If M = {T,c,p, B,G) is a CSM, where T has n vertices and c : E{T) —>■ N takes 
only positive-integer values, then the GOAS-OP can be solved in 0{B‘^n)-time. 

Remark: (i) Although we are not able to obtain a compact expression for the exact number 
of arithmetic operations that yield Theorem 15.21 the bound N{n, B) = 2(n — 1)B^ still is an upper 
bound, as for Theorem 14.21 (ii) Note the assumption that c is an integer-valued cost function is 
crucial, since otherwise, we would not have been able to use the recursion ([2]) in at most B steps. 

Corollary 5.3. The GOAS-DP when restricted to integer-valued penetration costs can be solved 
in pseudo-polynomial time. 
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5.2 Approximation Scheme 

We now can present a polynomial time approximation scheme (PTAS) for solving the GOAS-OP 
from Problem 12.51 In Observation 13.11 we saw that the GOAS-OP is an NP-hard optimization 
problem. But this is not the whole story; although it is hard to compute the exact solution, one 
can obtain a polynomial time approximation algorithm if we allow slightly more budget for the 
attacker than he/she wants to spend. We will in this section describe one such approximation 
scheme. Our approach here is similar to the PTAS for the optimization of the 0/1-Knapsack 
Problem presented in the classic text m Section 17.3]. 

We saw in Theorem 15.21 that GOAS-OP can be solved in 0(B^n)-time, if the cost is integer 
valued and B is the budget of the attacker. So for large B this can be far polynomial time. For 
each fixed t G N we can write the integer cost c(e) of each edge e € E{T) as 

c(e) = Cg{e) + Cr{e), where Cr{e) = c(e) mod 2*, (3) 


that is, we obtain a new cost function Cq by ignoring the last t digits of c(e) when it is written as 
a binary number. Since each Cq is divisible by 2*, solving GOAS-OP for Cq and budget B is the 
same as solving it for the cost function 2“*Cq and budget 2~^B. Therefore, we can by Theorem 15.21 
solve the GOAS-OP for this new cost function Cq in 0((2“*B)^n)-time. 

Let T' (resp. T/) be an optimal GOAS-OP subtree of T w.r.t the cost c (resp. Cg), so p{T') 
is maximum among subtrees with c-weight < B, and p{T^) is maximum among subtrees with 
Cq-weight < B. In this case we have 

c(r') = Cq(r') + c,(r') <B + \EiT ')\• 2 * < B + n2\ (4) 

Also, since Cq{T') < c{T') < B we have by the definitions of T' and T/ that p{T') < p{T'^). Therefore 
if there is a GOAS T' w.r.t. the cost c, then there certainly is one w.r.t. the cost Cq, namely Tq. 
Hence, if e = then we obtain from ([4]) that c{Tq) < (1 -|- e)B and T/ is here definitely a GOAS 
that further can be computed in 0((n/e)^n) = 0((l/e)^n^)-time. Conversely, for a given e > 0, we 
obtain such an approximation algorithm by considering the cost Cg defined by ([3]) where 


t = 



(5) 


We therefore have the following. 

Theorem 5.4. The GOAS-OP admits a polynomial time approximation scheme; for every e > 0 
a GOAS T' of cost of at most (1 -I- e)B can be computed in 0((l/e)^n^)-tzme. 

Remarks: (i) In establishing the above Theorem 15.41 we started with an integer cost function 
c ; E{T) —)■ N. The same approach could have been used for a rational cost function c : E{T) —>■ Q 
where c(e) has d binary binary digits after its binary point (i.e. radix point when written as a 
rational number in base 2.) By considering a new integer valued cost function c' : E{T) —N, 
where d{e) = 2'^c(e) for each e G E(T), we can in the same manner as used above, obtain an 
approximation algorithm where we replace B with B' = 2‘^B. Needless to say however, in this 
case the corresponding cost function is obtained by truncating or ignoring only t — d oi the 
digits of c' (instead of the t digits of c), to obtain a solution using a budged of (1 -|- e)i?. (ii) 
Further along these lines, if the cost function c : E{T) —>■ Q is given as a fraction c(e) = a(e)/5(e), 
where a(e),b{e) G N are relatively prime, we can let M be the least common multiple of the 6(e) 
where e G E{T) and obtain by scaling by M a new integer valued cost function c" : E{T) N 
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where c'^(e) = Mc{e) for each e € E{T). Again, since c" is integer valued we can in the same 
manner obtain an approximation algorithm where we replace B with B" = MB. In this case 
the corresponding cost function c" is obtained by truncating or ignoring even fewer digits, namely 
t — \gM of the digits of c”. This will also yield a polynomial time approximation algorithm in 
terms of n and 1/e despite the fact that M can become very large (i.e. if all the costs have pairwise 
relatively prime denominators h{e).) 

5.3 General Weighted Trees 

In our framework a CSM M is presented as a rooted tree provided with two weight functions: one 
on the vertices and one on the edges. In the model the root serves merely as a starting vertex and 
does not (usually) carry any weight (that is, has no prize attached to it). However, given a general 
non-rooted tree T provided with two edge-weight functions 'w,w' : E{t) ^ Q, we can always add a 
root to some vertex and then push the weights of one of the weight functions, say w down to the 
unique vertex away from the root. In this way we obtain a CSM M to which we can apply both 
Theorems 14.21 and 15.21 With this slight modification, we have the following corollary for general 
weighted trees. 

Corollary 5.5. Let T be a tree on n vertices, w,w' : E{T) —>■ Q two edge-weight functions, and 
B, G two rational numbers. If the function w is either (i) a rational constant c ^ Q or (ii) integer¬ 
valued, then the existence of a subtree T' ofT such that w'{T') < B and w{T') is a maximum can 
be determined in 0{mf‘n)-time, where m = [H/cJ in case (i), and in 0{B‘^n)-time in case (ii). 

6 Cyber Attack with Rational Penetration Costs 

In this section we consider the more-general case of a CSM M = {T,c,p, B,G) where the cost 
function c : E{T) —>■ Q takes at most d distinct rational values, say ci,... ,Crf G Q. This case can 
model quite realistic scenarios, as there are currently only a hnite number of known encryption 
methods and cyber-security designs, where a successful hack for each method/design has a specific 
penetration cost. As in previous sections, we will utilize dynamic programming and recursion 
based on the splitting of a subtree r of a planted plane subtree into two subtrees ti and t" as in 
m and ([2]). However, here we are dealing with rational-cost values (i.e. arbitrary real values from 
all practical purposes), and that the we are able to obtain a polynomial time procedure in this case 
is not as direct. 

Note that if M is the least common multiple of all the denominators of ci,...,Crf, then by 
multiplying the cost and the budget of the attacker through by M, we obtain an integer valued 
cost function Me, which then can by Theorem 15.21 be solved pseudo polynomially in 0{M‘^B‘^n)- 
time. Our goal here in this section, however, is to develop an algorithm to solve GOAS-OP in 
time polynomial in n alone. 

For each i G d}, let Uj = \{e G E{T) : c(e) = Cj}|, and so = n = \E{T)\ = 

|H(r)| — 1. Let H = {0,1,... ,ni} x • • • x {0,1,... ,nd} C and note that \B\ = nf=i(^* + !)• 
Denote a general d-tuple of by x = (xi,..., x^), and let x < y denote the usual component-wise 
partial order Xi < yi, for each i G {!,... ,d}. If c = (ci ,... ,Cd) G is the rational-cost vector, 
let C = {x G : X > 0, c • x < B} C denote the d-dimensional pyramid in with the d -|- 1 
vertices given by the origin 0 = (0,..., 0) and (0,..., H/cj,..., 0), where i G {1,... , d}. To estimate 
the number of non-negative integral points in C, we count the number of unit d-cubes within the 
pyramid C. Since [xj < x < [xj -|- 1 for each rational x, then each x G C is contained in the unit 
d-cube with the line segment from [xJ = ([xij,..., [x^J) to [xJ -|-1 = (-|- 1,... , [x^J -|- 1) as its 
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diagonal. Since c-x < B, then c - ([xj +1) < B ->rYli=i Ci, and hence, the number of integral points 
in C is at most the volume V{C') of the associated pyramid C' = {x € : x > 0, c-x < B'} C 

where B' = B + Yli=i that is, at most \y{C')\, where 


B' 




i=l 


1 

li\ 




2 = 1 


Ci 


Note that a CSAS T' of a CSM M has fc* edges of cost Cj for each i if and only if € -B D C'. 

Definition 6.1. For each i let m* = min(|'B'/cj],nj), and let m = 

Remark: Note that we have m = Ef=i — Ef=i therefore any upper bound 

polynomial in m will yield a bound in the same polynomial in terms of n. 

If C" = {0,1,..., [RVci]} X ■ ■ • X {0,1,... , \B'/cd \}, then C' n C C", and 


B n C' = B n (C' n Z'^) C B n C" = {0,1,... , mi} x • • • x {O, l,... , m^j (6) 


Hence, by the Inequality of Arithmetic and Geometric Mean (lAGM), we get 


|B n C'\ < |B n C''\ = Wirrn +1) < 


2=1 


EtiK + 1) 

d 


/m \'i 

= '7 + ' 


We summarize in the following. 


Observation 6.2. If M is a CSM with Ui edges of cost Ci for each i £ {1,..., d}, then |B n C'j < 
{m/d + 1)“^, which is a polynomial in m = Ef=i degree d. 


Remark: Note that if B'/ci < Ui for each i, then m* = min(|'B'/ci],nj) = \B'/c/]. In this case 
we have C' fl Z*^ C B and so C' n Z*^ = C' n Z*^ n B = C' H B, and so again by the lAGM, we obtain 


IBnc'l = ic'nz'^i < L^(c')J = 




i=l 


< 


1 

a 


m \d. 

7 + ' 


where now m = '^i=i\B'/ci\, which shows that, although polynomial in m of the same degree d 
as in Observation E21 the number of possible k £ 13 nC is a much smaller fraction of {m/d + 1)“^. 

We now proceed with our setup for our dynamic programming scheme. As before, the idea is 
simple; we construct a multi-dimensional matrix/array for each vertex u of T, the construction of 
which is computed in a recursive manner, as for the previous 2x2 matrices M(ri) and N(ix). 

Specifically, for each vertex u we assign a d{u) x |B D C'|-fold array 


A(m) = A|(m) 


- fcsRnC', l<i<d{u) 


where A|(tt) is the maximum prize of a subtree of T^{u) containing kj edges of cost Cj for each 

j £ {1,... ,d} and each k £ BcC. For 0 = (0,... , 0), we have A^^{u) = p{u) for each vertex u for 
i = 1 ,..., d{u). 

Convention: For i £ {!,...,(!} and an edge e € E{T), let di{e) = where for every pair 
of rational numbers x,y £ Q 


\ 1 if X = y, 

1 0 otherwise 


is the Kronecker delta function. Further, let 6{e) = ((5i(e),... ,6d{e)). 

As in © and ([2]), we use the same decomposition of a subtree r of T into ti and r", and as 
with previous Lemmas 14.11 and 15.11 we have the following. 
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Lemma 6.3. The subtree r rooted at u is a maximum-prize subtree among those with ki edges of 
cost Ci for each i and that contains the leftmost child ui of u if and only if the included subtree of 
Ti is a maximum-prize subtree among those rooted at ui and with a* edges of cost ci for each i and 
the included subtree of t" is a maximum-prize subtree rooted at u among those that do not contain 
Ui and with fii edges of cost Ci for each i, for some a, fd ^ B r\ C, where a -\- (3 = k — 5{e{ui)). 

For a vertex u and an arbitrary subtree r rooted at u, we let be the maximum prize 

of a subtree of r rooted at u with ki edges of cost c* for each i € {1,..., d}. If a maximum-prize 
subtree of r with ki edges of cost Ci does not contain the edge from u to its leftmost child ui, then 
Ai^{u; t) = Aj^{u] t"). Otherwise, such a maximum subtree contains Oj edges of cost Cj from and 
Pi edges of cost c* from r", where ai Pi = Ci — 5{e{ui)) for each i G {1,... , d}. Finally, for each 
leaf u of T, each i, and k & B nC] we set A^{u) = p{u). As previously, we get by Lemma [631 the 
following recursion. 

Ar {u; r) = max i Ai{u;t"), max (Aa{ui;ri)-\-AA uit")) ] . (7) 

Y a+g=k-S(e(ue)) ^ ) 

Lemma 6.4. The evaluation of each A^^{u) takes at most 2{m/d -\- 1)'^ arithmetic operations. 

Proof. For each x = {xi,... ,Xd) G let 7r+(x) = nf=i(®* + !)• © each A^(u) requires 

7 r~^{k — 6{e{u£))) additions and 'K~^{k — S{e{ui))) comparisons, and hence all in all 27r~^{k — 5{e (ui))) 
arithmetic operations. 

By ([6|) we have that k&BnC'TBr) C", and hence, kj < nij for each j G {1,..., d}. Thus, by 
the lAGM, there are at most 

d d ^ 

27r+(fc - 6 {e{ui))) < 2 -|- 1) < 2 -|- 1) < 2 -\- 1^ 

j=i i=i 

arithmetic operations for evaluating each A^'^fu). □ 

Assuming each arithmetic operation takes one step, the total running time to evaluate the entire 
array A[u) is at most a constant multiple of 


Nd { n ) = 


d(u) 

ueV(T) keBnC 


E ‘iwK E + i 


\ueV{T) 


, , cm \d /m 

< 2|E(r)|(- + l) 2(- + l 


\keBf\C' 
m 

7 


4 („-l) (™ + l) 


2d 


We then obtain the desired maximum prize p{T') of a VCSAS T' by p{T') = max^g^^ij,, 

for the root r of T of our CSM M, which takes at most |.8 D C'| — 1 < {m/d -\- comparisons. 
Hence, we obtain the following. 
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Theorem 6.5. If M = (T, c,p, B, G) is a CSM where T has n vertices, m is given hy Detinition \6.1l 
and c : E{T) —)■ Q takes at most d distinct rational values, then the GOAS-OP can be solved in 
0 {rn^'^n)-time. 

Remarks: (i) Note that when d = 1, and hence ci = c, then m in Theorem 16.51 is given by m = 
mi = /ci\,n) = min(|'R/c] + l,n), whereas in Theorem 14. 21 m = \B/c\ = min(|'R/c],n), 

by the assumption that \B/c\ < n. Still, the complexity when d = 1 in Theorem 16.51 clearly 
agrees with the complexity of 0{m?‘n) for solving the GOAS-OP when c is a constant function in 
Theorem 14.21 (ii) If each m* = 0{f{n)), for some “slow-growing” function of n, then Theorem 16.51 
yields an 0(n/(n)^'^)-time algorithm for solving the GOAS-OP. In particular, if each mi = 0(1), 
then Theorem 16.51 yields a linear-time in n algorithm to solve the GOAS-OP. 

Corollary 6.6. The GOAS-DP when restricted to d rational-valued penetration costs can be solved 
in polynomial time. 

7 Summary and Conclusions 

This paper defined a new cyber-security model that models systems which are designed based on 
defense-in-depth. We showed that natural problems based on the model were intractable. We then 
proved that restricted versions of the problems had either polynomial time or pseudo-polynomial 
time algorithms. Table [U in Section [T] summarizes our results. They suggest that in a real system 
the penetration costs should vary, that is, although each level should be difficult to attack, the cost 
of breaking into some levels should be even higher. The tree representation of the models suggests 
that systems should be designed to distribute targets in a bushy tree, rather than in a narrow tree. 
Most security systems are linear, and such systems could be strengthen by distributing targets 
more widely, providing defense-in-deception. Although in most situations a cyber attacker will not 
a priori know exact penetration costs, target locations, and prizes, the model still gives us insight 
into which types of security designs would be more effective. 

We conclude the paper with a number of open questions. 

1. Can we quantify how much targets need to be distributed in order to maximize security? For 
example, does an (n -|- I)-ary tree provide provably better security than an n-ary tree? 

2. Can we prove mathematically that the intuition of storing high-value targets deeper in the 
system and having higher penetration costs on the outer-most layers of the system results in 
the best security? 

3. If targets are allowed to be repositioned periodically, what does that do to the complexity of 
the problems, and what is the best movement strategy for protecting targets? 

4. Using the model, can one develop a set of benchmarks to rank the security of a particular 
system? How would one model prizes in a system? 

5. Can the notion of time and intrusion detection be built into the model? That is, if an attacker 
tries to break into a certain container, the attacker may be locked out, resulting in game-over 
for that attacker, or perhaps may face an even higher new penetration cost. 

6. Are there online variants of the model that are interesting to study? For example, a version 
where the topology of the graph changes dynamically or where only a partial description is 
known to the attacker. 


16 


Acknowledgments 


This work was in part motivated by a talk that Bill Neugent of MITRE Corporation gave at the 
United States Naval Academy in the fall of 2011. We thank Bill for initial discussions about game- 
over issues relating to cyber-security models. Thanks also to Richard Chang for discussions about 
the model. - Finally, we like to thank the two anonymous referees for their careful reading of the 
paper, their pointed comments and suggestions which resulted in a greatly improved presentation 
of the results and made them more complete. 


References 

[1] El Houssaine Aghezzaf, Thomas L. Magnanti, and Laurence A. Wolsey. Optimizing Constrained 
Subtrees of Trees. Mathematical Programming, 71(2);113-126, Series A, (1995). 

[2] Geir Agnarsson and Raymond Greenlaw. Graph Theory: Modeling, Applications, and Algo¬ 
rithms, Pearson Prentice Hall, Upper Saddle River, NJ, (2007). 

[3] Robert G. Armstrong, Jackson R. Mayo, and Frank Siebenlist. Complexity Science Challenges 
in Cyber security, Sandia Report, March 2009. 

[4] Tania Branigan. “Chinese Army to Target Cyber War Threat.” The Guardian (Lon¬ 
don). WWW.theguardian.com/world/2010/jul/22/chinese-army-cyber-war-department, 
retrieved October 1, 2013. 

[5] Hayes Brown. “No Longer in the Shadows, Cyberwar’s Potential is now an Open 
Secret.” Think Progress. thinkprogress.org/security/2013/10/04/2699361/cyber- 
conf lict-just-over-the-horizon/, retrieved October 15, 2013. 

[6] Deepayan Chakrabarti and Christos Faloutsos. Graph Mining: Laws, Generators, and Algo¬ 
rithms. AGM Computing Surveys, 38(1), article 2, 69 pages, (2006). 

[7] Sofie Coene, Garlo Filippi, Frits Spieksma, and Elisa Stevanato. Balancing Profits and Costs 
on Trees. Networks, 61(3);200-11, (2013). 

[8] “2012 Cost of Cyber Crime Study; United States,” Ponemon Institute, research report, 
29 pages, October 2012. 

[9] Daniel M. Dunlavy, Bruce Hendrickson, and Tamara G. Kolda. Mathematical Challenges in 
Cyber security. Sandia Report, February 2009. 

[10] Michael R. Garey and David S. Johnson. Computers and Intractability: A Guide to the Theory 
of NP-Completeness, W. H. Freeman and Company, New York, (1979). 

[11] Paul Goransson and Raymond Greenlaw. Secure Roaming in 802.11 Networks, Elsevier Science 
and Technical Book Group, (2007). 

[12] Raymond Greenlaw, H. James Hoover, and Walter Larry Ruzzo. Limits to Parallel Computa¬ 
tion: P-Completeness Theory, Oxford University Press, (1995). 

[13] Sun-Yuan Hsieh and Ting-Yu Ghou. Finding a Weight-constrained Maximum-density Subtree 
in a Tree. Algorithms and Computation, Lecture Notes in Computer Science, 3827:944-953, 
Springer, Berlin, (2005). 


17 



[14] Robert Johnston and Clint LaFever. Hacker.mil, Marine Corps Red Team (PowerPoint Pre¬ 
sentation). (2012). 

[15] Hoong Chuin Lau, Trung Hieu Ngo, and Bao Nguyen Nguyen. Finding a Length-constrained 
Maximum-sum or Maximum-density Subtree and Its Application to Logistics. Discrete Opti¬ 
mization, 3(4):385-391, (2006). 

[16] Christos H. Papadimitriou and Kenneth Steiglitz. Combinatorial optimization: algorithms 
and complexity, Prentice-Hall, Inc., (1982). 

[17] Shari Lawrence Pfleeger. Useful Cybersecurity Metrics. IT Professional, ll(3);38-45, (2009). 

[18] Rachel Rue, Shari Lawrence Pfleeger, and David Ortiz. A Framework for Classifying and Com¬ 
paring Models of Cybersecurity Investment to Support Policy and Decision-making. Proceedings 
of the Workshop on the Economics of Information Security, 23 pages, (2007). 

[19] Fred B. Schneider. Blueprint for a Science of Cyber security, The Next Wave, 19(2):47-57, 

( 2012 ). 

[20] Sajjan Shiva, Sankardas Roy, and Dipankar Dasgupta. Game Theory for Cyber Security. Pro¬ 
ceedings of the ACM 6*^ Annual Cyber Security and Information Intelligence Research Work¬ 
shop, article no. 34, April 21-23, (2010). 

[21] Paul Sparrows. Cyber Crime Statistics, hackmageddon.com, retrieved October 16, 2013. 

[22] Hsin-Hao Su, Chin Lung Lu, and Chuan Yi Tang. An Improved Algorithm for Finding 
a Length-constrained Maximum-density Subtree in a Tree. Information Processing Letters, 
109(2):161-164, (2008). 

[23] Jung Sung-ki. “Cyber Warfare Command to Be Launched in January.” Koreatimes.co.kr. 
www.koreatimes . CO .kr/www/news/nation/2013/07/205^6502 .html, retrieved October 1, 
2013. 

[24] William Jackson. “DOD Creates Cyber Command as U.S. Strategic Command Subunit.” 
Federal Computer Week, fcw.com, October 16, 2013. 


18 



